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This paper studies the tree partitioning of a graph whose definition is based on the pattern of 
zero elements present in a given matrix. This partitioning then indicates a particularly advantageous 
strategy for employing block Gaussian elimination over a certain class of matrices. The strategy is 
exploited for matrix inversion, where it is especially appropriate for problems which require finding 
only selected submatrices of the inverse. A graph-theoretic algorithm is given for automatically 
generating tree partitionings for any matrix. Combinatorial properties of this procedure are also 
discussed. 
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1 . Introduction 

The solution of "sparse" systems of linear equations has reeeived considerable attention in 
recent years [13] \ [15], [19], and justifiably so, since coefficient matrices having relatively few 
nonzero entries arise quite frequently from physical problems [1], [4], [8]. The object of this paper is 
to discuss a partitioning method for inverting such sparse matrices which can rather readily adapt to 
the given zero-nonzero structure of a particular sparse matrix. This work is motivated by the 
observation [12] that the solution of a linear system is particularly simple when the graph underlying 
the coefficient matrix is exactly a tree. The present approach shows how a natural tree-like 
representation can be obtained for any matrix and how this leads to an especially simple method for 
inverting the given matrix. In addition, a straightforward extension of the method allows one to 
solve, rather effectively, systems of linear equations. 

Henceforth, our attention will be focused on the class K of square matrices all of whose 
principal submatrices are nonsingular. Inasmuch as the class K contains all positive definite 
symmetric matrices (which do arise quite often in physically meaningful contexts), the method to be 
described will be applicable to a number of important situations. It is not necessary that the 
coefficient matrix A be symmetric; in fact, nonsymmetric matrices for which A + A T is positive 
definite also belong to K, as well as the nonsymmetric M-matrices [6] (having nonpositive off- 
diagonal elements and all principal minors positive). 

2. Tree Partitionings 

Let the real n x n matrix A be in K. Eventually, a simultaneous permutation of the rows and 
columns of A will be produced so that the resulting matrix has an especially simple form with 
identifiable (and exploitable) blocks of zero entries. To do this, we first associate a finite undirected 
graph Ga with the matrix A = (ay). The nodes of G A correspond to the rows/columns of A, and an 
edge U\ j) will join node i to node j whenever |a„| + |a j? | > 0. 

In any graph C, the sequence of edges (e u e 2 , . . ., e k ) forms a chain whenever k + 1 nodes i , 
ii, . . ., i k can be identified so that Cj joins nodes ij- X and ij for j = 1, . . ., k; a chain is elementary 
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if it does not meet the same node twice. A chain which consists of at least three distinct edges and 
which joins a node to itself is termed a cycle. If every pair of distinct nodes in a graph is joined by a 
chain, then the graph is said to be connected. A tree is a connected graph which contains no cycles. 
It is now supposed that the nodes /V of the graph G A are partitioned into m > 2 distinct sets of 
nodes N u N 2 , . . ., N m which exhibit a tree structure when viewed as an undirected graph T A . 
Namely, the nodes of T A are the sets N h N 2 ,. . ., /V m and an edge joins N t with Nj (i = j) in T A 
whenever an edge exists in G A between some node in /V, and another node in Nj. The graph T A so 
constructed is assumed then to be a tree. It will be shown in section 4 how to construct for 
connected graphs such a tree partitioning. (If the graph G A is not connected then clearly A is 
decomposable as a block diagonal matrix; in this case the inverse of A is readily computable from 
the inverses of the diagonal blocks, each of which corresponds to a connected graph.) As an 
example, the "snowflake" graph [14, p. 209] depicted in figure 1 has the tree partitioning given in 
figure 2. 




Figure 1. An illustrative graph. 
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FIGURE 2. A tree partitioning for the graph of figure I (the nodes comprising each node set are also shown). 

The concept of a tree partitioning has been previously applied [16] to shortest path calculations 
in large sparse networks. This concept can also be applied to the present situation of matrix 
inversion. The basic idea is that of decomposing the original problem into smaller subproblems, the 
solutions to which can be easily recombined to yield the information required from the original 
problem. 

The present decomposition approach relies on a fundamental property of trees. This is, in each 
tree there always exists a pendant node — a node which is incident with precisely one edge. When 
such a pendant node i and its incident edge are deleted from a tree T, then a new tree T — i is 
formed. This process of deleting a pendant node together with its incident edge can then be 
repeated anew using the tree T - i. Therefore, given the tree T A , the node sets can be suitably 
relabelled so that N l is a pendant node of T A , N 2 is a pendant node of T A — N u . . ., and /V w _! is a 
pendant node of T A — N { — ... — /V w _ 2 . Such a labelling of T A is shown in figure 2. Moreover, for 
each index i £ m there is associated a unique index r(i) such that N ni) is the next node set after /V, 
on the unique elementary chain joining N t and N m in the tree T A . Because of the assumed labelling 
induced by the pendant node sets of T A , it is always the case that r(i) > i. Table 1 displays these 
indices for the tree of figure 2. 



247 



Table 1. The indices r(i)for the tree of figure 2. 



i 


1 


2 


3 


4 


5 


6 


7 


8 


9 


r(i) 


3 


3 


10 


6 


6 


10 


9 


9 


10 



3. The Matrix Decomposition Algorithm 

In this section, a general decomposition algorithm for inverting matrices A e K will be presented 
and justified. This algorithm is based on a tree partitioning N = (N u N 2 , . . ., N m ) of the associated 
graph G A . Let the matrix A be partitioned as A = (/4 y ), where Ay is the submatrix composed of 
entries a uv where u e N t and v eNj. Let A' 1 be similarly partitioned as A' 1 = (.¥„). It is assumed 
that the node sets have been suitably relabelled in accordance with the scheme described in section 
2. Hence, the ah row ofA,i =£ ra, is given by 

(3.1) A { = (An,- . .,Au, ,. . ., 0,A Ur(ih ,. . ., 0), 

where the submatrices A^ = (k < i) unless r(k) = i. Also, the rath row of A is given by 

(3.2) A m = (A m ,. . .,A mm ), 

where the submatrices A mk = (A: < ra) unless r(k) = ra. The proposed decomposition algorithm for 
obtaining^ -1 horn A is embodied in the following procedure. 



PROCEDURE INVERT. 

1. Set Ba = An, i = 1,. . ., m, 

2. For i = 1,. . ., ra — 1 calculate 



(3.3) 
(3.4) 



Bi,r(i) ~ £>ii ^M.rO') 

£>r(i),i = ~ A ni)> i b n 



and perform the replacement £*«,**) : = B^d^o + A myj B itriD . 

3. Compute X mm = B^m and for i = m — 1,. . ., 1 

*i,r(i) — £>i,r(i)X r (i),r(i) 
X r (i),i — A r(i),r(i) " r(i),i 

%ii = Bii 1 + Bi,r(j) XriDj. 

4. The remaining yVy blocks can be calculated using 

%ij = ^i,r(i)^r(j),j i < j 

%ij ~ % UrU) " r(j)j l >J- 

It is noted that this procedure requires the inversion of submatrices B it as well as the 
multiplication and addition of appropriate submatrices B jk . Of course, there is no reason why the 
submatrices B u cannot be inverted (if still sufficiently sparse) by further applying the decomposition 
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algorithm to the subnetworks based on individual node sets TVj. Such an approach can be recursively 
applied until the sparsity of the fi^'s has been exploited to its fullest. 

The validity of the procedure will now be established. First, note that at the end of Step 2 of 
INVERT, 

(3.5) &u = A n+ X A ikB ki . 

r(k)=i 

Since A~ x = X = (£#) is the solution to 

(3.6) A X = /, 

then from (3. 1) with i 4 1 j , i < m 

(3.7) 1 A ik X kj + AuXis +A Urii) X rii)J = 0. 

nk) = i 

It is now claimed that 

(3.8) X u = Bi fr a)Xni)j if i <j. 

This assertion will be established inductively. Inasmuch as i = r{k) > k, equation (3.5) shows that 
#n = A u . Thus, using (3.7) withy > 1 

A\\X\j + A i,r(lK» r(l)J = U, 

whence from (3.3) 

X\j — ~ A u A ltra) X nl) j = B i, r (i)A r(l)J- 

Therefore, the assertion (3.8) holds for i = 1. Suppose the assertion is true for indices k < i, where 
i < m. Then from (3.7) with i <j 

2* A ik B k iXij + A aX u + A j, r a)X r(J) j = 0, 

r(k)=i 

since k < r(k) — i <j and so the inductive hypothesis can be applied. By (3.5) the above relation 
becomes 

t>uXjj -+■ A i,r(i)X r (i)j = 0, 

whence 

X ij — "Oji j4 i,r(i)X r(i)J = B i, r (i)X r( /)j, 

using (3.3). Accordingly, the assertion (3.8) is established by induction. 
In particular, choosing j = r(i) in (3.8) yields 

(3.9) X i, ni) — />i,r(i>^r(0,r(/) " I < 171. 
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Furthermore, from (3.2) and (3.6) 



or, applying (3.9), 



2* A mkS* km ' **. mm A mm 1 

r(k)=m 



2* A mktt km^ mm ' ** mmA mm *> 

r(k)=m 

so that from (3.5), B mm X mm = /, whereupon 

(o. 1U) A mm D mm . 

Finally, from (3.1) and (3.6) with i =£ m 

Z, A ik X ki + AuXii 4- A irifiX wf),i — / 

r(k)=i 



or, using (3.8) and (3.5), 



DjjXa + /4i jr( ,-)A r (,),,- — I. 



By virtue of (3.3), 



(3.11) 



X a — t> a t> it A i, r (i)X rii)>l 



The counterparts to equations (3.8) and (3.9) can be established by using the relation X A = I 
instead of (3.6), thus yielding 



(3.12) 
(3.13) 



%ii ~ % i,r(j)KrU)J " l > J 

X r (i),i = X r(i),r{i)£> rii),i it I < 171 



Together, eqs (3.8) — (3.13) serve to establish the validity of procedure INVERT, provided that the 
submatrices B it encountered for inversion are nonsingular. This property of the B^s will next be 
shown to follow from the fact that A e K. First, given any tree partitioning of G A , the matrix A = 
(A^ e K evidently satisfies the following condition: 

Block Principal Submatrix (BPS) Condition. Given the matrix A partitioned into m 2 blocks by 
corresponding sets N u N 2 , . . ., N m of row and column indices, then any submatrix determined by 
deleting corresponding sets Nj (possibly none at all) is nonsingular. 

It will now be shown that, under the transformation described by Step 2 of INVERT, this 
property is inherited by certain submatrices of the transformed matrix. Let A = (A ) be written as 



A =M' 



B n 


o • 


' * ^l,r(l) 


• 











4 r(l),l 




M, 




6 
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After the transformation with i = 1 in Step 2 of INVERT, the matrix M x will be transformed into a 
new matrix MJ which differs from M 1 only in that the diagonal block B riX)>nu has been replaced by 



£>Hl),r(l) ~ £>r(l),r(l) + A r(l),l ^M,r(l) 



(3.14) 



= B r 



A r(l),l^ll ^l.r(l)' 



Now it is easily verified that M\ = (M'JB u ), the Schur complement [2] of #n in M' , defined by: 



M 



A B 
C I) 



=> (MZ4) = D - CA~ l B. 



Given that A = M' is BPS, it is claimed that M\ is also BPS. In fact, consider any principal 
submatrix S' determined by the blocks of M[. If the diagonal block B' nlhra) does not occur in S', 
then S' is also a principal block submatrix of MJ, and hence is nonsingular. If S' does contain 
B'r(i),ra)i then consider the corresponding submatrix S of M' obtained by replacing B' raura) by its pre- 
transformation value B nl)ynl) . Let T be the submatrix of Mq defined by 



r b u 


• • 


A\,r(i) 


■ 











^ r(l),l 




S 




. o 









Since T and B u arc principal block subniatriccs of \I' {) (which is BPS), then F and /^ n are 
nonsingular. Moreover, by the determinantal formula of Schur [7] 

det T = det B u - del (7Yfl u ), 

it follows that (TIB U ) is nonsingular as well. However, it is easily seen from (3.14) that {TIB U ) = S' 
and so S' is nonsingular. Finis, M', is BPS. 

More generally, the above argument can be repeated to show that \{ M' k _ x is BPS then the 
submatrix M k defined by 



M' k . x = 



B kk 


. 


. .A k 


,r(k) - ■ 


. 













d r (k),k 






M k 
















yields a transformed M' k (after executing Step 2 of INVERT with i = k) which is also #/\S. Given, 
then, that each M k is BPS we can conclude that in particular the diagonal block B k+uk+l is 
nonsingular. This is precisely what is needed to ensure that at each iteration of Step 2, the matrix 
Bji 1 is defined. Accordingly, the procedure INVERT is guaranteed to be well-defined and will then 
produce the matrix A~ l as required. It should be noted that if A satisfies the BPS condition for the 
given partition, then the INVERT procedure will remain valid, even if A is not a member of K. 

In effect, procedure INVERT is a statement of block Gaussian elimination, which becomes 
especially simple and manageable in the context of a tree partitioning. It should be noted that in 
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Steps 2 and 3 the blocks of the inverse matrix (corresponding to nonzero blocks of the original 
matrix A) can be calculated without introducing any new "fill-ins"; this is just to say that the given 
zero blocks are preserved. Moreover, we see that the existence of a tree partitioning allows one to 
solve linear equations in very much the same way as inverting matrices. Rather than needing to 
perform submatrix inversions, we need instead to solve a linear subsystem of equations at each step. 

4. Finding Tree Partitionings 

In this section, an algorithm will be described for obtaining tree partitions of an undirected 
graph G = (N,E). Without loss of generality, it may be assumed that G is a connected graph having 
at least two nodes. Indeed, any graph which is not connected can be resolved into at least two 
maximally connected subgraphs 2 or connected components. Clearly, then, the tree partitioning 
problem can be studied and solved separately with regard to each of these connected components. 

The basic idea of the algorithm can be most easily explained by means of an example. 
Consider, therefore, the connected graph of figure 1 for which N = {1, . . ., 18}. The process is 
begun by selecting a subset TVi of nodes from C\ = N with (j) C /Vi C C\ ; 3 say, N t = {4, 7, 10}. It 
is then easy to compute 

rWi) = {j e d - Nil {i, j) e E for some i e NJ, 

the set of all nodes in C\ adjacent with (but not including) nodes in N x . For this example, Y{N{) = 
{2, 3, 6, 8, 9, 11, 12, 15, 16}. It is also easy to find the connected components of the subgraph 
based on the nodes C\ — N\ and thus the node sets C k constituting each of these connected 
components. Here, C 2 = {8, 9, 13, 14, 15}, C 3 = {11, 12, 16, 17, 18} and C 4 = {1, 2, 3, 5, 6}. The 
node sets N 2 , /V 3 , and N 4 are then chosen as N 2 = C 2 O r(N t ) = {8, 9, 15}, N 3 = C 3 n T(N X ) = 
{11, 12, 16}, andiV 4 = C 4 n IWi) = {2, 3, 6}. 
The next iteration of the process computes 

TW 2 ) = {j € C 2 - N 2 : (i, j) € E for some i e N 2 } 
= {13, 14} 

and determines the sets C 5 = {13} and C 6 = {14} associated with the two connected components 
for C 2 - /V 2 . The node sets /V 5 and N 6 are given by N 5 = C 5 O l\N 2 ) = {13}, and /V 6 = C 6 f T(N 2 ) 
= {14}. In similar fashion, subsequent iterations produce /V 7 = {17}, 7V 8 = {18}, /V 9 = {1}, /V 10 = 
{5}. The tree partitioning which reflects this disposition of node sets (apart from relabelling) is 
shown in figure 2. 

A general statement of the tree partitioning procedure for connected graphs G = (N 9 E) is 
provided by 

PROCEDURE TREEPART. 

1. Let N t satisfy $ C N t C N. Set d = N, p = 1, q = 1. 

2. Compute F(N P ) = {j e C p - W p : (f, j) e E for some i e N p }. If IW„) = (f> then go to Step 

3. Otherwise, 

a. Find the node sets C q+U . . ., C q+r constituting the connected components of the 
subgraph determined by C p — N p . 

b. Form N q+i = C q+i O F(N P ) for i = 1, . . ., r. 

c. Join node set N p by an edge to each of the node sets N q +u • • ., ^ Q+r . 

d. Let q : = q + r. 

3. Letp : = p + 1. If p < q then go to Step 2. Otherwise, terminate. 

In the above algorithm, the actual edges of the tree formed from the node sets N k are generated at 

2 A subgraph of a given undirected graph G = (N,E) has for its nodes a suhset N C N and contains those edges of £ which join nodes in AV 

3 The notation A C B means that A is a proper subset of B. 
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Step 2c. Because Ni is a nonempty proper subset of /V, the resulting tree partitioning has always at 
least two and at most /i = 1 7V| constituent node sets. Generally speaking, the less the density of 
edges in G, the greater will be the number of node sets in a tree partition for G. (This statement will 
be given a more precise form in the theorem which appears later in this section.) 

The fact that G is connected ensures that each node of G will appear in some node set N k . 
Moreover, the connectivity of G is also enough to guarantee that each of the node sets generated in 
Step 2b is nonempty. Indeed, suppose that N p £ c/> and, say, N Q+l = <fi. This means that no edges 
exist between nodes in C q + X and nodes in N p . Since the nodes of C q +i form a connected component 
relative to the nodes of C p — N p , then it follows that C Q+l forms a connected component of C p . 
Because N p =/= </>, then C Q+1 C C p and thus one connected component properly contains another. 
This is a manifest contradiction, and so N p £ <fi must imply that /V Q+1 =£ (/>,..., N Q + r =£ (/). In view 
of the fact that N\ 4=- (/), this observation can be repeatedly applied to demonstrate that all node sets 
N k are nonempty. Since the N Q+1 , . . ., N Q+r defined in Step 2b are nonempty, the edges generated 
in Step 2c correctly reflect the adjacency relations between the node sets in G. In addition, the fact 
that no two distinct node sets N Q+h N q+j formed in Step 2b are connected by a chain in C p — N p 
(else C q +i and C Q +j would be connected in C p — N p ) is enough to guarantee that the procedure 
always generates a tree graph for the node sets N k . Accordingly, it has been verified that 
TREEPART does indeed produce a tree partitioning for every (nontrivial) connected graph G. 

It is interesting to note that when applied to a graph which is itself a tree this procedure yields 
the original tree back again whenever Ni is chosen to consist of a single node. In general, different 
choices for the initial node set TVi will lead to different tree partitionings. Thus, TREEPART allows 
one to obtain a whole range of different tree partition structures for the same graph G. In order to 
obtain a "good" decomposition of the original network (e.g., one containing a large number of node 
sets), a reasonable heuristic might be to choose an N x which consists of relatively few nodes and 
which itself forms a highly connected subgraph. It often seems desirable to choose N\ so that N — 
N\ has several connected components. 

As noted above, the number m of node sets in a tree partitioning is at least 2 and at most n. A 
more precise upper bound on the number of node sets is given by the following result. 

THEOREM: If G = (N,E) where |N| = n, |E| = k then the number m of node sets in a tree 
partitioning of G satisfies 



m < 



3 + \/4n 2 - 4n - 8k + 1 



2 



where [ ] is the greatest integer function . Moreover, this upper bound is actually tight; that is, there 
exist graphs with n nodes and k edges for which equality obtains. 

First, it will be convenient to prove the following lemma 4 . 

Lemma: For a given tree structure T with node sets N t , N 2 , . . ., N m , the maximum number of 
edges possible in any underlying graph G is achieved when |Nj| = 1 for i =£ p and |N„| = n — m + 
1, where N p is a node set of maximal degree in T. 

PROOF: Given the tree structure F, regard the quantities n f = | TV ,| as integer variables 
constrained by 

m 

(4.1) Uj > 1, y n f = n. 



4 A referee has pointed out a simplified proof of the theorem which does not use this lemma. However, because the lemma refers to a more general problem 
and is of independent interest, we include its statement and proof. 
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For given rc/s, the maximum number of edges in the underlying graph G occurs when each N t 
corresponds to a complete subgraph, and when for each (N i9 Nj) e T all of the possible n { rij edges 
between the corresponding subgraphs of G are actually present in G. This maximum number is given 
by 

1 m 
F(n) = F(n u . . ., n m ) = - £ n t (n t - 1) + 2 { n^: (N u Nj) e T}. 

^ i=l 

We therefore address the problem of maximizing F(n) subject to (4.1). Let n be an arbitrary 
assignment of weights to the nodes satisfying (4.1). It will be shown that there exists an assignment 
to the nodes of weights n° having the form 

(4.2) n t = 1, i ± p 

n p = n — m 4- 1, 

with N p a node set in T of maximal degree, such that F(n) < F(n°). 

In order to show this, consider the following procedure which performs a reassignment for some 
current assignment n relative to the edge (N i9 Nj) e T. 

PROCEDURE REASSIGN (i 9 j). 

1. Let w t be the sum of weights n s for all node sets N s adjacent to node set N ( {s =£ j) and let 
Wj be the sum of weights n s for all node sets N s adjacent to node set Nj {s + i). Define r = Ui 4- 
rij. 

2. If w t > Wj then set n\ = r — 1, n\ = 1. Otherwise, set n[ — 1, Aij = r — 1. 

It is clear that for this new assignment n', where only n t and n 5 have been changed, equation (4.1) is 
satisfied. Moreover, it will now be shown that F(n) < F(n'). Indeed, if / = n t and J = nj then 

F(n) = a + -{/(/- 1) + JU ~ 1)} + IJ + /^i + /wj, 

where a is independent of/ and J. Or, setting / 4- J = r, 

F(n) = a 4- - r(r - 1) 4- /w* + (r - I)tVj = /B 4- 7(w, - w^-), 

where /3 is independent of/, /. Thus, if w^ > Wj it is clearly advantageous, in the sense of increasing 
F(n), to make / as large as is feasible; namely, / = r — 1. Similarly, if m^ < i^ then setting J = r - 
1 will increase F(n) as much as possible. Since this is precisely what is being done in REASSIGN, 
one has F(n) < F{n'). 

Suppose that procedure REASSIGN is performed so long as there are adjacent nodes A^j and Nj 
for which n t > 1, rij > 1. Consider then the set P of node sets N t for which n, > 1. Note that the 
lemma certainly holds when P = <£. If |P| > 2, it will be demonstrated that a further reassignment 
of node weights rii can be made which does not decrease F(n) but which will reduce \P\. 

Assume, then, that A^ and Nj are in P. Thus (N u Nj) i T since otherwise procedure REASSIGN 
could have been employed. In fact, each node set N k adjacent with N t must have weight n k = I, and 
similarly for Nj. Let d { be the degree of node set N h and dj the degree of Nj. Also, set / = n h J = 
rij and r = I + J. It is claimed that F(n) cannot be decreased by performing the reassignment 

(4.3) 



Hi = 1 , rij = r — 1 


if dj > d, 


rij — I, rii = r — 1 


if rfj < d t 
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Indeed, 



F(n) = a + - r(r - 1) - IJ + /rf, 4- ,/rfj 

= a + - r(r - 1) - /(r - /) + Id t + (r - /)dj 

= /3 + / 2 + (d< - dj - r)/, 

where a, /3 are independent of/, J. Because g(/) = I 2 + (c/; — dj — r)/ is a convex function of/, it 
is maximized over the interval [1, r— 1] at one of the endpoints. It is easy to verify that 

g(l) > g(r - 1) <=> (dj - rffXr - 2) > <=> dj > d„ 

since r = I + J > 2. Therefore, the reassignment given in (4.3) cannot decrease the value ofF(n). 
In other words, given node sets /V,, /Vj e P, one of the two sets can be made to have weight 1 
without decreasing F, and so can be removed from P. By continuing this procedure, P eventually 
reduces to a single node set with weight w > 1. Finally, by using REASSIGN and (4.3) as necessary 
with respect to the node set /V ;> of weight w > 1 and each node set of weight I, it will be assured 
that the resulting N p is in fact a node of maximal degree d p in T. Thus, the assignment which is 
ultimately produced is of the form (4.2). Since the value F{n) is never decreased throughout the 
process, we indeed have F{n) < F(n°). Since the original assignment n was arbitrary and since all 
assignments n° in (4.2) have the same value F(n°), the lemma is proved. 

By virtue of this lemma, the number k of edges in an undirected graph G with a tree 
partitioning of n nodes into m node sets satisfies 



F(n°) = -(n - m + \)(n - m) + d p (ft - m + 1) + (m - 1 - d p ), 



since a tree on q nodes has precisely q — 1 edges. Thus, 

k < - (n — m + \){n — m) + d p (n — m) + (m — 1) 

= a (ai, m) + d v (n — m), 

with a (n, m) independent of d„. Accordingly, the quantity {a (ft, m) + d ; , (n - m)} is maximized by 
choosing d p as large as possible for fixed n, m: namely, d p = m — 1 (all other dj = 1). In any event, 
then, 

k < - (n - m + l)(n - m) + (w - l)(n - m + 1) 



2 



- (n 2 - ft - 2 + 3m - ft? 2 ), 



whence 

A(ift) s m 2 - 3m - (ft 2 - ft - 2A: - 2) < 0. 
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Now the roots m x and m 2 of h{m) = are m\ = — (3 — y/D) and ra 2 = - (3 + vD), where Z) = 9 + 

4(rc 2 — n — 2k — 2). Because & < - zi(n — 1) then D > 1 whence ra x < 1 and ra 2 ^ 2. It follows 
that h(m) < for all ra with 1 < m < ra 2 . Inasmuch as ra must be a positive integer, then 

3 + V4n 2 - 4n - 8/c + 1 



ra < [ra 2 ] 

Therefore, the first part of the theorem is established. To prove the second half, consider the graph 
G = Gin, ra) which consists of n — ra + 1 "central" nodes, every two of which are joined by an 
edge, and m — 1 "satellite" nodes, each of which is joined by an edge to every central node. 
The number of edges in G is thus 

k = - {n — m + l)(n — ra) + (ra — \){n — m -h 1) 

1 



and the resulting m 2 — - (3 + v(2m — 3) 2 ) = tft. Moreover, this graph on n nodes does admit of a 

tree partitioning with m node sets: namely, choose m — 1 node sets each containing a single satellite 
node, and an rath node set containing all central nodes. Accordingly, the second part of the theorem 
is verified. 

It is worthwhile to note that when the underlying graph is a tree {k = n — 1), the upper bound 
provided by the theorem is exactly n. Here again the upper bound is tight since a tree on n nodes 
admits of a tree partitioning into n node sets (just let each /V, contain a single node). When the 
underlying graph is complete (that is, every pair of distinct nodes is joined by an edge) then a tree 
partitioning can have at most two node sets. In fact, the upper bound provided in the theorem for 

the situation k = — n{n — 1) is seen to be 2 also. As a final illustration, the upper bound on ra is 

calculated to be 17 for the graph depicted in figure 1 (n = 18, k = 33); the tree partitioning of this 
graph shown in figure 2 contains 10 node sets. 



5. Computational Remarks 

The principal virtue of the tree decomposition algorithm given in section 3 is that computations 
need only be performed on arrays which are significantly smaller than the original matrix. 
Accordingly, much larger matrices than could normally be accommodated in core can be inverted. 
Moreover, the form of partitioning employed seems to readily adapt itself to the particular zero- 
nonzero structure of the matrix being studied. Of course, the tree partitioning concept is even more 
appropriate when repeated matrix inversions are to be made for a sequence of matrices differing 
only in that nonzero entries are reestimated or varied parametric ally [4], [9]. Indeed, for a fixed 
structure of sparseness the underlying graph remains the same and so the tree partitioning can be 
found once and for all. In addition, it should be pointed out that TREEPART is at worst an 0{mk) 
algorithm since no edge of the graph need be scanned more than ra times. In practice, the amount 
of computational labor required to find a tree partitioning is really quite modest. For example, when 
the underlying graph has k = 0(ra), as would often be the case for sparse matrices (e.g., resulting 
from the rectangular and triangular lattices which arise in numerical solution of partial differential 
equations), the theorem of section 4 shows that mk ^ (Xn 2 ): accordingly, TREEPART is at worst an 
Oin 2 ) algorithm for these sparse graphs. 

Moreover, the tree decomposition approach is able to exploit effectively the sparsity of the 
original matrix and thus reduce the amount of computation required for matrix inversion. Suppose, 
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for example, that each of the node sets TV,- has |/V,| = r. Then the use of INVERT to find the inverse 

of A requires 0(m 2 r 3 ) = 0(n 2 • —) operations; thus for a given number of nodes, the computational 

m 

effort decreases as the number of node sets increases. Without exploiting sparsity, standard methods 

for finding/^ -1 necessitate ()(/7rV\) operations. Furthermore, the decomposition procedure allows the 

user selectivity in calculating the submatrices X u of A' 1 . Indeed, INVERT only requires the 

calculation of the 3m — 2 submatrices X u which correspond to nonzero Ay in the original matrix^. 

The remaining (m — l)(m — 2) submatrices can be calculated, if desired, during Step 4 of the 

algorithm. Thus, the present decomposition approach would be especially appropriate for problems 

which require finding only certain submatrices of A~ l ; such a situation arises when one is interested 

in finding the variances of estimated coefficients in multiple linear regression, since the required 

variances are derived from the diagonal entries of a matrix (X T X)~ l . 

Several procedures have been described for transforming a given matrix into one with a 

particular partitioned block structure [3], [10], [11], [17], [18, Ch. 3]. The use of a tree partitioning 

seems sufficiently flexible to deal with a wide range of possible partitions. Fiedler [5] discusses a 

type of partitioning more general than that given here; however, the solution method indicated in [S] 

appears to involve too many submatrix inversions to be practically advantageous. Another approach 

to inverting matrices using graph-theoretic concepts has been described by Harary [10]. Such a 

method unfortunately is of little use when the matrix A is irreducible. A generalization of Harary's 

method is presented in [3]. In addition, Mayoh [11] and Steward [17] have discussed techniques for 

permuting the rows and columns of A so that the resulting matrix has a particularly simple form. 

The computational requirements of such techniques are, however, difficult to assess. 



6. References 

[ 1| Churchill, \1. E. A sparse matrix procedure for power systems analysis programs, in Large Sparse Sets of Linear 

Equations, J. k. Reid, Ed., (Academic Press. London, I971). pp. 127-138. 
[ 2| Cottle. K. W. Manifestations of the Schur complement, Linear Algebra and Appl. 8(1974), pp. 189-211. 

[ 3] Dulmage, A. L. and Mendelsohn. N. S. On the inversion of sparse matrices. Math. Comp. 16 (1962). pp. 494-496. 
[ 4] Erisman, A. M. Decomposition methods using sparse matrix techniques with application to certain electrical 

network problems, in Decomposition of Large-Scale Problems, 1). \1. Himmelblau, Ed., (North-Holland 

Publishing Co., Amsterdam, 1973), pp. 69-80. 
[ 5] Fiedler. M. On inverting partitioned matrices, Czechoslovak Math. J. 13 (196)). pp. 571— 586. 
[ 6] Fiedler, M. and Ptak, V., On matrices with non-positive off-diagonal (dements and positive principal minors, 

Czechoslovak Math. J. 12 (1962), pp. 382-400. 
[ 7] Gantmacher. F. H., The Theory of Matrices, Vol. 1, (Chelsea Publishing Co.. New York. I960). 
[ 8] George, A., Nested dissection of a regular finite element mesh, SIAM J. Nnmer. Anal. 10 ( 1973). pp. 345-363. 
[ 9] Gustavson, F. G., Liniger, W. and Willoughby, R., Symbolic generation of an optimal Croul algorithm for sparse 

systems of linear equations, J. Assoc. Comput. Mach. 17 (1970), pp. 87-109. 
[10] Harary, F., A graph theoretic approach to matrix inversion by partitioning, Numer. Math. 4 (1962), pp. 128-135. 
[11] Mayoh, B. H., A graph technique for inverting certain matrices, Math. Comp. 19 (1965), pp. 641—646. 
[12] Parter, S., The use of linear graphs in Gauss elimination, SIAM Rev. 3 (1961), pp. 119—130. 
[13] Reid, J. K.. Ed., Large Sparse Sets of Linear Equations. (Academic Press, London, 1971). 
[14] Rose, D. J., A graph-theoretic study of the numerical solution of sparse positive definite systems of linear 

equations, in Graph Theory and Computing, R. C. Read, Ed., (Academic Press, New York, 1972), pp. 183-217. 
[15] Rose, D. J. and- Willoughby, R. A., Eds., Sparse Matrices and Their Applications, (Plenum Press, New York, 

1972). 
[16] Shier, I). R., A decomposition algorithm for optimality problems in tree-structured networks. Discrete Math. 6 

(1973), pp. 175-189. 
[17] Steward, D. V., Partitioning and tearing systems of equations, J. Soc. Indust. Appl. Math. Ser. B Numer. Anal. 2 

(1965), pp. 345-365. 
[18] Tewarson, R. P., Sparse Matrices, (Academic Press, New York, 1973). 
[19] Willoughby, R. A., Ed., Sparse Matrix Proceedings, IBM Research Report RAl, Yorktown Heights, New York, 

1969. 

(Paper 80B2-439) 
257 



